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Abstract —Low-complexity carrier frequency offset (CFO) es¬ 
timation and compensation in multi-user massive multiple-iuput 
multiple-output (MIMO) systems is a challenging problem. The 
existing CFO estimation algorithms incur tremendous increase in 
complexity with Increasing number of base statiou (BS) antennas, 
M and number of user terminals (UTs) K (l.e. massive MIMO 
regime). In this paper, we address this problem by proposing a 
novel low-complexity algorithm for CFO estimation which uses 
the pilot signal received at the BS during special uplink slots. 
The total per-channel nse complexity of the proposed algorithm 
increases only linearly with Increasing M and is independent 
of K. Analysis reveals that the CFO estimation accnracy can 
be considerably improved by increasing M and K (i.e. massive 
MIMO regime). For example, for a fixed K and a fixed training 
length, the reqnired per-nser radiated power dnring uplink 
training decreases as with increasing M. 

I. Introduction 

Massive multiple-input multiple-output (MIMO)/large scale 
antenna system (LSAS) has emerged as one of the possible 
key technologies for the next generation 5G cellular wireless 
network because of significant improvement in energy and 
spectral efficiency over conventional communication technolo¬ 
gies m. In massive MIMO systems the base station (BS) is 
equipped with a large number of antennas (in hundreds) to 
simultaneously serve few tens of autonomous user terminals 
(UTs) in the same time-frequency resource E). Increasing 
the number of BS antennas, M, provides array gain, which 
reduces the energy requirement for transmission m, El. 
These results however assume coherent operation between the 
UTs and the BS. In practice, the carrier frequency offsets 
(CFOs) between the carrier frequency of the signal, received 
at the BS and the BS oscillator can severely impact coherent 
detection of data, thus degrading the system performance. 
Substantial amount of work on CFO estimation and optimal 
pilot design for the same in conventional MIMO systems has 
been carried out in the past decade a-iiHi. However because 
of their prohibitively increasing complexity with the number 
of BS antennas and UTs, these algorithms are not suitable 
for implementation in massive MIMO systems. In a, for 
multi-user massive MIMO systems, the authors propose an 
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approximation to the joint ML estimator for the CFOs of all 
the UTs. The approximation still requires a multi-dimensional 
grid search and is therefore expected to have high complexity 
for large number of users. Also, 191 only considers frequency- 
flat channel. 

In this paper, we address the problem of CFO estimation in 
a frequency-selective multi-user massive MIMO system oper¬ 
ating in time divison duplexed (TDD) mode. The contributions 
of the work presented in this paper are: (i) we propose a 
simple uplink training scheme which satisfies the optimality 
criterion for pilot design a, 0 (with respect to minimizing 
the Cramer-Rao Lower Bound (CRLB) on CFO estimation); 
(ii) we propose a low-complexity multi-user CFO estimation 
algorithm at the BS and derive analytical expression for its 
mean square error (MSB). Exhaustive numerical simulations 
reveal that the proposed CFO estimator performs close to 
the CRLB, for sufficiently high per-user radiated power; (iii) 
analysis of the MSB expression reveals that for a fixed K 
and fixed training length, the required per-user radiated power 
to achieve a fixed desired MSB decreases as 1 /v/M with 
increasing M, provided that M is sufficiently large (i.e., 
massive MIMO regime); (iv) for a fixed and sufficiently large 
M (i.e. massive MIMO regime) and fixed training length, the 
required per-user radiated power to achieve a fixed desired 
MSB decreases with increasing K (as long as K is much 
smaller than the training length); and (v) the per-channel use 
total complexity of the proposed estimator is 0{M) and is 
independent of the number of the UTs, K. Therefore the 
required per-user radiated power to achieve a fixed desired 
MSB can be reduced by increasing K (fixed M and fixed 
training length), without any increase in the total per-channel 
use complexity. To the best of our knowledge, for frequency- 
selective massive MIMO systems, there is no other work which 
has analytically studied the impact of increasing number of BS 
antennas and number of UTs on the required per-user radiated 
power to achieve a fixed desired MSB. 

Notations: C denotes the set of complex numbers. E 
denotes the expectation operator. (.)^ denotes the complex 
conjugate transpose operation, while (.)* denotes the complex 
conjugate operator. Also, Jjv denotes the N x N identity 
matrix. 

II. System Model 

We consider a massive MIMO system with M base sta¬ 
tion (BS) antennas, communicating simultaneously with K 
autonomous single-antenna user terminals (UTs) in the same 



Fig. 1 The communication strategy, depicting allocation of separate 
UL/DL TDD slots for CFO estimation and compensation prior to 
data communication (shaded slots). For UL data communication, the 
UL slot spans half of the coherence interval, i.e., channel uses. 
The channel estimates acquired in a UL slot are used for downlink 
beamforming of information to the UTs, in the next DL slot. 


time-frequency resource. Massive MIMO systems are expected 
to operate in Time Division Duplexed (TDD) mode, where 
each coherence interval is divided into a uplink (UL) slot, 
followed by a downlink (DL) slot. In the UL, the BS uses 
pilot signals transmitted by UTs for channel estimation. These 
channel estimates are used to facilitate coherent detection of 
user data at the BS receiver. In the DL, the same channel 
estimates are used to beamform information symbols to the 
UTs. 

In a massive MIMO BS serving several UTs simultaneously, 
acquisition and compensation of carrier frequency offsets 
from different UTs is important. Therefore we propose the 
following communication strategy, as illustrated in Fig. □ At 
the beginning of communication, we propose to first estimate 
the CFOs from different UTs at the BS, using special pilots 
transmitted by the UTs in a special UL slot. In the following 
DL slot, the BS feeds the CFO estimates back to the UTs over 
a control channeQ and the UTs correct their internal oscillators 
accordingly (see Fig. [T]i. From the next UL slot, normal 
communication starts, and the UTs transmit pilot signals for 
channel estimation, followed by uplink data transmission. The 
special UL/DL slots for CFO estimation might be repeated 
after several coherence intervals, depending on how fast the 
CFO changes. 

The massive MIMO BS under consideration is assumed to 
operate in a frequency-selective channel environment. There¬ 
fore, the complex baseband discrete time channel impulse 
response has L > 1 taps. The channel gain coefficient between 
the fc* UT and the m* BS antenna at the tap is given by 

hkni[l] = (Thki gk 7 n\l] (cThki > 0), where / = 0,1,..., L - 1, 
k = 1,2,... ,K and m = 1,2,..., M. Here gkm\l] models the 
fast fading component of the baseband channel gain and are 
assumed to be independent and identically distributed (i.i.d.) 
and Rayleigh faded, i.e., circular symmetric complex Gaussian 
with unit variance, or gkm[l] ~ CN{11, 1). Also, V(A:, 1) 

models the power delay profile (PDF) of the channel. For 

*We consider this feedback to be error free. Exact mechanism for this 
feedback and impact of feedback en'ors is a topic of future study. 


communication purposes, we assume that the PDP is fixed 
for the entire duration of communication and is known to the 
BS. Also the channel realization remains unchanged over Nc 
channel usesH 


A. Signal Model 

Let ak\t] denote the transmitted signal from the fc* UT at 
time t. The signal received at the m* BS antenna is therefore 


given 


K L-l 
A 


^/cm [t l\e^ ^ -f 


( 1 ) 


1=0 


where nm[t] ~ CAf{0,a^) is the circular symmetric additive 
white Gaussian noise (AWGN) with variance ujk = 
2'KAfkTs where A/^ is the frequency offset at the BS for the 
/c* UT and = l/B*. Here is the total communication 
bandwidth. 


III. CFO Estimation 

In this section, we first present the Cramer-Rao Lower 
Bound (CRLB) on the accuracy of the best possible unbiased 
CFO estimate (see Section III-A). The CRLB also gives us 
insight into the optimal pilot signal design. Various algo¬ 
rithms have been devised for CFO estimation in single and 
multi-carrier MIMO systems a-ii. However because of 
prohibitive increase in their computational complexity with 
increasing number of BS antennas and UTs, these algorithms 
are not suitable for massive MIMO systems. We propose a 
simple orthogonal pilot sequence, which satisfies the optimal¬ 
ity criterion for pilot design (see Section III-B). Using these 
pilots we develop a low-complexity algorithm to estimate the 
CFO of each user separately (Section III-C). 


A. CRLB for CFO Estimation 

Let (afe[0], afe[l], • • • ,ak[N — 1]) be the sequence of N 
pilot symbols transmitted by the /c* UT in the UL slot 
for CFO estimation (see Fig. □>0 The signal received at 
the TO* BS antenna over the N channel uses is denoted 
by Vm = irm[0],rm[^], - ■ ■ ,rm[N - 1])^. Using O, with 
t = 0,1,..., N — 1 (here t represents the f* channel use), 
is given by Q ^ 

— ^ ^ -j- Tljfi. (2) 

Here, T{oJk) = diag{l,e^‘^’‘ 

Tljn — (rim [ 0 ] ; [ 1 ] 5 * * * j [A^ 1 ]) 5 and hkm — 

{hkm[0],hkm[l],--- ,hkm[L-l]f. AlsO Afc € is a 

circulant matrix whose {t, q)* entry is given by 


Ak{t,q) 


ak[t- q] t>q 

ak [t - q N] t < q 


(3) 


^Here Nc is the number of channel uses that span the coherence interval. 
If Tc is the coherence time and Bw is the communication bandwidth, then 

Nc = TcBw- 

^We assume a collocated BS model where all the BS antennas use the same 
oscillator. 

'^A copy of the last L — 1 pilot symbols, i.e., {a^ [A" — L +1], • • • ,ak[N — 
1]} is transmitted before {a^Mj''' ~ !]}• 
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Fig. 2 Illustration of the proposed transmitted pilot/training sequence 
for CFO estimation with K = 2, L = 2, N = 12 (i.e., B = 
N/KL = 3). 


where t = 

1,2,... ,N, and q = 1,2,...,L. 

The overall 

received signal vector, r = (rf, r^, • ■ ■ , 

is thus given 

by 0 

r = Q{u})h + n. 

(4) 

where h 

^ ,hl,V and 

h - 


Further, n = {nj,n^ 


and (jj 

= (a;i,a; 2 , • • • , Also, 

QH = 


Im 0 [r(a;i)Ai, r(a; 2 )^ 2 , • • • , Here ‘®’ 

denotes the Kronecker product for matrices. 

Based on the above data model, the CRLB for CFO estima¬ 
tion is given by Q, 151 : 

CRLB(a;)^^(^5i|y^PQy|^ , (5) 

where Pq = Imn - In (|5ll, 

the submatrix of V defined by rows [(to — 1)A^ -f 1,..., mN] 
in the fc* column is given by ^((to — 1)A^-|-1 : mN^ k) = 
FT{uJk)Akhk„, G where F = diagiO, !,■■■ ,N-1), 

TO = 1,2,..., M, and k = 1, 2,..., /C. 


B. Proposed Training Sequence for CFO Estimation 

From a, 0 for large N, the conditions for optimality of 
the training sequence (i.e. in terms of minimizing the CRLB) 
is given by 




oc II 
= OlxL, 


Vi^j,it,3)€{l,2,...,K}. ( 6 ) 


In this paper, we propose pilot signals which consist of pilot- 
blocks. Each pilot-block spans KL channel uses and there 
are therefore B = pilot-blocks. In each pilot-block, each 
UT transmits only a single impulse, which is preceded and 
followed by zeros. In the 6* pilot-block, the first UT transmits 
an impulse at f = (h—\)KL, the second UT at t = (h—\)KL-\- 
L, and the Ff* UT at f = (h - l)KL + {K - l)L. Since 
the impulses from different UTs are separated by L channel 
uses (i.e. the maximum delay spread of any user’s impulse 
response), the signal received at the BS antennas in the time 
interval (6 — 1)KL -f (F — 1)L <t< {b — 1)KL + [kL — 1) 
will simply be the impulse response of the channel for the fc* 
UT. These KL channel uses (L channel use for each of the K 
users) constitute a pilot-block (see Fig. |2). Since the channel 
realization remains static over an entire coherence interval, the 
CFO of a user can be estimated by correlating the impulse 


responses received from that user in consecutive pilot-blocks. 
The overall transmitted pilot for the fc* user is given by 

r,, f y/KLj^, tmoAKL = {k - l)L 
= \ 0, elsewhere, 

where p^^ is the average power of the pilot signal transmitted 
by each UT0 An example of the proposed pilot sequence is 
illustrated in Fig. |2 

It can be shown that for the proposed training sequence in 
(|7]i, Af Aj = 0 and Af Ai = KLIl, when i j, i = 
1,2,... ,K and j = 1,2,..., K, i.e., the proposed training 
sequence satisfies the optimality conditions (see ®). 

C. Proposed CFO estimate for the fc'* UT 

For the proposed pilot signal given in (|2ll, the received signal 
at the TO* BS antenna at time instance r(6, k, 1) = [h— l)KL-\- 
{k — 1)L + I is given by 

rm[T{h, k, 0] = \JKLpu hkm[l] nm[T{b, k, /)], 

where Z = 0,1,..., L—1, k = 1,2,..., K and b = 1,2,..., B. 
The correlation of the received pilots in consecutive blocks, 
i.e., at time instances r(&, k, 1) and r(6 -f 1, k, 1) is given by 
0 (see top of the next page). Note that the argument of the 
first term in ® is UkKL, i.e., it depends on the CFO of the 
fc* user, Wfc. Since the argument of this term is independent of 
the BS antenna index, to, the channel tap I and the pilot-block 
index b, we can average the noise terms in 0 (denoted as 
Ck{b, TO, 1)) over b, m and 1. This averaging is given by 


A rl^[T{b,k,l)]rm[T{b l,k,l)] 

P^ = - 


L-l 

6=1 m=l 1=0 MKL{B - l)pu E ^Ikl 

1=0 

Here izk and are given by 

B-l M L-l 

E E E Ck{b,m,l) 

A 6=1 m=l 1=0 
Vk = - 


Gk = 


MKL{B-l)p,,'t"<ki 

1=0 

/ M L-l \ 

E E \hkn.[ir 

m—1 l—O 


L-l 


V 


M E 


1=0 


'hkl 


(9) 


( 10 ) 


( 11 ) 


/ 


From 0 we note that the argument of the first term 
depends on the CFO of the fc* UT and its magnitude, Gk, is 
independent of the CFO. Therefore, we propose the following 
estimate of the CFO for the fc* user 


. A arg(pfc) 

~ KL '' 


( 12 ) 


where arg(c) denotes the ‘principal argument’ of the complex 
number c. 


^Use of the scaling factor \/KL guarantees that the average power 
transmitted from each user is pu, irrespective of the number of users and 
the number of channel taps. 













r’^[T{b, k, l)]rjn[T{b + 1, kj)] = KLpu\hkm[l]\'^e^‘^’^^^ 




4 Ti 




_^nm [T(6+l,fc,/)]n^ [T(b,k,l)] 

"-V-^ 

kT. 


,.( 8 ) 


A 


Sum of Noise Terms, Ck(b^ TTl^ I') = Ti -j- T2 + T3. 


Remark 1: Note that the proposed CFO estimate in (fT2l) is 
well-defined if and only if \ujkKL\ < tt. For most mactical 
systems, we believe that this condition will hold trueO □ 
Remark 2: With a carrier frequency fc, let the maximum 
CFO for any user be A/ = nfc (note that for mobile 
terminals, k might depend on the velocity of the terminal.). 
Since |a;fciCL| must be less than tt and ojk = 2'K/S.fTs, we 
must have 


TT > |27rA/iFLr,| ^ 2TrKKf,Ta, (13) 

where (a) follows from the fact that Td = LTg is the delay 
spread of the channel. Therefore from (fOT l. it follows that, for 
the proposed CFO estimator in (fT^ to work, the maximum 

number of allowed users must be less than -— 7777 . Note that 

this maximum limit is usually quite large. For example with 
/c = 2 GHz, Td = 5/rs and k = 0.1 PPM, we have K < 500. 

□ 


Remark 3: From (|9|l and (fTSl i it is clear that the total number 
of operation required to compute all the K CFO estimates is 
{M{B — 1)L + 1)K. Since N = BKL, the average number 
of operations per-channel use is {M{B — 1)L + 1)K/N « M 
(N ^ K). Note that the total per-channel use complexity is 
independent of K and increases only linearly with M. □ 
Remark 4: From (fTTT i and the strong law of large numbers, 
it follows that for i.i.d. {hkm[l\}, —>■ 1 as M —>■ 00 , with 

probability 1 . □ 

Lemma 1. The mean and variance of = 5R(z^fe) and 
= ^{vk) are given by E[i/(] = E[i/^] = 0 and 


nKY] = 


= 


(1 + frf cos(2wfeiTL)) 
MKL{B- 1) 

- |5f cos(2wfcKT)) 


MKL{B-1) 


A Pu 2 


Here Tfe = ^ E 
rr 1=0 

Proof: From ( fTOb we have 
vi = = - 




MKL{B- 1)’ 

, (14) 




MKL{B- !)■ 

( 15 ) 


MKL{B - l)pu -2 


-K{X4, 


1^0 


hkl 


^For instance, with a earner frequency of /c = 2 GHz, and a oscillator 
accuracy of 0.1 PPM (parts per million) (commonly used in cellular BSs Qol) 
the maximum carrier frequency offset is A/ = fc x 10“^ = 200 Hz. For 
a system having communication bandwidth Sw = 1 MHz (i.e., Tg = 10“® 
s) this corresponds to a CFO of = ^irAfTs = 27v x 200 x 10“® = 
47r X 10“^^ radian, which is 2500 times less than tt. Therefore for massive 
MIMO systems, even with FC = 10 and L = 5, \uJkKL\ = n. 


^k = 

A 

where Xk = E 


:- — -3{A4, (16) 

MKL(B-l)p, E <^lki 

M L-1 i^Q 

E E rn, /) and Cfe( 6 , m, Z) is defined 


b—1 m—1 l—O 

in ®. Since rimft] are all complex circular symmetric, i.i.d. 
Gaussian, we have E[Ti] = 0,i = 1,2,3 (Ti are defined in 
®) and therefore E[cfe(6, rn,l)] = 0, V b = 1, 2,..., i? — 1, 


Z = 0,1,..., T — 1 and m = 1, 2,..., M. 

B-l M L-l 

Clearly, E[Afe] = E E E E[cfc(&, m, Z)] = 0 . There- 

b—1 m—1 l—O 

fore from (fTfil i. we get E[i/^] = E[z/^] = = 0 . 

The real components of Ti, i = 1, 2,3 are given by 


JR(ri) = KLp^\hkm[l]\ \nm[T{h + l,k,l)\\ 

cos (^Zum [rib -f 1 , fc, Z)] - ZZifcm [z] - uJkT{b, fc, Z)^, 


5ft(r2) = ^KLpf\hkm [Z]| \nm[T{b,k,l)]\ 
cos - Z.nm[T{b,k,l)] + Thh^[l] + ujkT{b + l,k,l)^, 
and, 5R(T3) = |nm[T(6, fc,Z)]| |nm[T(Z)-f 1, fc, Z)]| 

cos [z.nm[T{hT l,fc, Z)] - Zn,„[r( 6 , fc, Z)]^ , (17) 

where Zc represents the ‘principal argument’ of c € C. 
Using expressions in (inii, the variance of is given 

by (fTSl l (see top of the next page). Using the expression for 
E[(5i(Afc))2] from (fTSI l and the expression for vj, from (fTfil) . 
the variance of vj, is given by d). Similarly, the variance of 
can be shown to be given by d. ■ 

Remark 5: From (fT4l i and (fTSl) it is clear that E[(i/^)2] and 
E[(z/^)2] decrease with increasing number of BS antennas, M. 
This reduction in the variance of and with increasing 
M is due to the averaging of the noise terms (defined in ®) 
across all M BS antennas (see (fTOb). Further as M —oo, 
since and are average of a large number of noise 
terms (see (fTOl i). it is expected that both and would 
be asymptotically Gaussian distributed. □ 


Theorem 1. (Approximation of the CFO Estimate) If 
\uJkKL\ -C TT and jk ^ 7 °, then the proposed CFO estimate 
in d can be approximated by q 

Wfc ~ Wfc -f 77 - 777 -, (19) 


GfeiCL’ 


where 7 ^ = 


B-l 

2B-3 


KGk 


Y^l-b 2ML(^2B-3)^ - 1 


Proof: See Appendix. ■ 

Corollary to Theorem 1: If |a;fe/CL| <C tt and jk ^ 7 ^, 
then the mean square error (MSE) of the proposed CEO 
estimate for the fc* UT is given by 































E[(K(Xfc))^]=E 

/ B-l M L-l \ 

E E E 3^(Ti+T2+T3) 

B-l M L-l 

= E E E 

KLp„\hkmll]fcr^< E 

cos^ ( a;fcT(&+l,fc,/)+Z/ifcm [^] —Zn^n [^(^>,^,0] J 


Y 6=1 m = l 1 = 0 J 

6 = 1 m = l 1 = 0 
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V /. 
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/ M 

COS^ ( Znm['L{b-\-l,k,l)]—ljJkT{b,k^l) — Z.hkrn[^ j 

j+<r^E 

cos^ ( Z.nm['T{b~\-l,k,l)] —Z.nm['L{b,k,l)] j 


B -1 M L -1 


+2 ^ Y1 12 i ^J^Pu\hkn 

b =2 m = l Z=0 


COS I [t(&+ 1,A:,Z)] —[Z] —ci;fcT(&+2,A;,Z) 


^ cos ^ 


Zn^yi [T{b-\-l,k,l)]—Z.hkm 


B-l M L-l 

^ 'll Y 1 Y. \ KLp^\hkr, 

6=1 m = l 1 = 0 


B-l M L-l 

^a^+^V2 K E ^ KLp,\hkr, 

6=2 m = l 1 = 0 


( 18 ) 



Fig. 3 Plot of Simulated and Theoretical MSB with varying SNR, 
compared to the CRLB, depicting the improvement in the MSB in 
massive MIMO regime (M = 40, K = 5), with respect to the non- 
massive MIMO regime (M = 2, K = 2). Bixed N = 100 and 
L = 2. 


E[(wfe - ujkf] 


1 / G, 1 


7fcV.B-l 2K^kJ 
M{N - KL){KLYGl' 


( 20 ) 


Proof: For \uikKL\ <C tt and 7 ^ ;g> 7 °, from (fT^ . the 
expression for CFO estimation error for the /c* UT is given 
by 


,Q 


AuJk = Wfc - Wfe 


GkKL 


( 21 ) 


Clearly, E[Awfe] ~ 0. The variance of Awfc, i.e., the mean 
square error (MSE) is then given by 


E[(Awfc)'] 


GliKLy 


( 22 ) 


Further since |a;fciCL| -C tt, we have cos{2uJkKL) Ki 1. 
Using (fTSl l (with the approximation cos{2uJkKL) 1) in (l22li . 

we get (I 20 I 1 . ■ 

Remark 6: From (l20l i. it is clear that the MSE diminishes 
with increasing received SNR, 7 fc. We also illustrate this 
through Eig. |2 where we plot the variation in the MSE 
of the CEO for the first user, i.e., E[(wi — wi)^] averaged 
over the channel statistics, as a function of increasing SNR, 
71 = 7 ? T,2o ^hii- The PDF of the channel is = 1/L, 


I = 0,1,..., L — 1, and k = 1,2,..., K. We also plot the 
CRLB (given by ©). We have N = 100 and L = 2. We plot 
both the theoretical MSE (see (l20l) ) and the simulated MSE. 
With M = 40 and K = 5 (i.e. massive MIMO regime), it is 
observed that the proposed estimator is near optimal at high 
SNR (i.e. the value of SNR for which MSE <g; w^). However 
with M = 2 and K = 2 (i.e. non-massive MIMO regime), we 
note that the proposed estimator does not perform as good as 
the CRLB. The above observations therefore motivate the use 
of the proposed estimator for massive MIMO systems. This is 
even more so because the well-known near-optimal estimators 
used in conventional small MIMO systems have prohibitive 
complexity and cannot be used in massive MIMO systems. 
On the contrary the proposed estimator has low-complexity 
and is therefore well-suited for massive MIMO regime. 

Also, for M = 40 and K = 5, since Gk ~ 1 (see Remark 
4), we have 7 ° « 0.0055 (i.e. —22.5 dB). As can be seen 
in Eig. [3l for all 7 fe ^ 7 ° = —22.5 dB, the simulated MSE 
(curve marked with triangles) and the theoretical MSE (curve 
marked with squares) are exactly the same, i.e., the MSE 
approximation in (l 20 l i is tight. □ 

Using the Corollary to Theorem 1, we present the following 
propositions. 

Proposition 1: If \uJkKL\ <c tt and 7^ 7°, then for a 

desired MSE, e > 0, and fixed M, K, L and N, the required 
received SNR, 7 ^ is given bjQ 


A G./(B- 1 ) 

^ ^ ’ 2ed 



2{B-lftd: 
KGl , 


(23) 


where d = M[N - KLfKLfGl- 

Proof: Since \ijJkKL\ <C tt and 7 ^ ^ 7 °, then from (l20l) . 
for a desired MSE e > 0, we get 

1 / Gfc , 1 \ 


_ 7fcV.B-l 2K-fk J 

M[N - KL){KLYGl 

or, erf7fe - ;^^7fc “ ^ (24) 

where d = M{N — KL){KL)^G\. and N = BKL. Solving 
(I 24 I 1 for the required 7 fc, we obtain (|2^ . ■ 


^For ease of notation, subsequently we would be treating the R.H.S. of 
(20j as the exact value for MSE since the approximation in <20j is tight when 
ItJfcFfLI < TT and 7 *. > 7 ^. 



































Remark 7: From ( |23] ) it follows that for a fixed M, N, K 
and L, the required 7fc(e) increases with decreasing desired 
MSB, e. Further it can be shown that to achieve a desired 
MSB e <C Wfc, the required 7fc(e) is much greater than 7°, i.e., 

7fe(e)»7fc- □ 

Proposition 2: (Impact of the Number of BS Antennas) 
Consider \u:kKL\ tt and 7^ ^ 7^. For a fixed K, L, N 
and M sufficiently large, i.e., 

^ (25) 




2e(N-KL)^’ 


the required SNR to achieve a fixed desired MSB e <C is 
inversely proportional to v/M, i.e., 7fc(e) oc 1/VmE 
Proof: From (|25]) . we have 

2{B-lfed 


2e{N - KLYM:^ K 


KGl 


> 1 


(26) 


where d = M{N-KL){KLYGI and N = BKL. From 
it follows that 


1 + W1 + 


2{B-lYed 


KGl 


l2{B-lY€d 
KGl ■ 


Using the approximation from dZTl i in (|2^ . we get 


7fe(e) 



1/VM 


(27) 


(28) 


^2eK^L^{N - KL)GY 

where (a) follows from d = M{N — KL){KLYGI- It is clear 
from (l28l l that with fixed K, L and N, for a desired MSB, e, 
7fc(e) decreases as l/^fM with increasing M. ■ 

Remark 8: From proposition 2 it follows that for a fixed 
N, K, L and a fixed desired MSB e, the required SNR 7fc(e) 
decreases by approximately 1.5 dB with every doubling in the 
number of BS antennas, M. The same behaviour is exhibited 
by the numerically simulated required SNR (see Fig. Hi. We 
explain this observation in the following. For a fixed N, K 
and L and a fixed desired MSB, e, the variance of must 
also be fixed (see (l22l i'). The variance of is given by the 
R.H.S. of (flSl l which consists of two terms, the first term is 
proportional to 1/M^k and the second term is proportional to 
XjMYl- Therefore, for a fixed variance of , 7^ cannot be 
reduced faster than 1 /v/M with increasing M, since otherwise 
the second term will increase with increasing M. □ 

Remark 9: When KL N and M is sufficiently large (see 
(l25l l). the required SNR 7fc(e) to achieve a desired MSB e is 
approximately given by (replacing (TV — KL) by N in (|28]) ) 

^ 3/2 ^2eMNL^Gl' 

From (|29] | it is clear that with fixed M, L, N, fixed 
desired MSB e and with increasing K, the required SNR 7fc(e) 
decreases with increasing K as long as KL N. 

In the following we explain why the MSB decreases with 
increasing K (fixed M, L and N). From (fT2l i we note that 
the estimate Uk is l/KL times arg(pfc). The division by K 


*Even with increasing M the required 7 fc(e) S> 7 ®. This follows from 
Remark 7 and the fact that 7 ® oc 1/\/M for fixed N, K and L (see the 
expression for 7 ^ in Theorem 1). 


helps in reducing the impact of the noise term in pk (i.e. Vk, 
see ([Tol l) on the MSB of the proposed estimate. Note that 
the division of arg(/3fc) by KL is only because the proposed 
pilot-blocks are spaced KL channel uses apartQ Further the 
variance of the noise term in pk (i.e. i/k) is also a function of 
K (see (fTrli and (fTSI l). In both (fT4li and (fTSI) . both the terms 
on the R.H.S. have MKL{B - 1) = M{N - KL) in the 
denominator. For KL <C TV, (TV — KL) Ri TV and therefore 
the first term on the R.H.S. of both (fT4l i and (fTSl) does not 
vary significantly with increasing K. However the second term 
on the R.H.S. has l/2K^l in the numerator which decreases 
with increasing Therefore the variance of the noise term 
in Pk also decreases with increasing K as long as KL N. 
Finally the division of arg(pfe) by K and the reduction in the 
variance of Vk with increasing K, results in reduction of the 
MSB of u!k with increasing K. This automatically implies that 
for a desired MSB e the required SNR 7fc(e) decreases with 
increasing K. The same behaviour is also exhibited by the 
numerically simulated required SNR (see Fig. |5]). 

In Remark 3 we have seen that the total complexity of the 
proposed CFO estimator does not increase with increasing K 
(TV ^ K). Therefore, with increasing K (TV ^ KL), the 
required SNR decreases with no increase in complexity, which 
is interesting. □ 

IV. Numerical Results and Discussions 

In all simulation studies presented in this section, we assume 
an operating carrier frequency /^ = 2 GHz and a maximum 
CFO of 0.1 PPM of /c (see footnote 6 in Remark 1). Therefore 
< 2 im' communication bandwidth is i?* = 1 
MHz, the maximum delay spread is = 5/iS and the 
coherence interval is Tc = 1 ms. Therefore from the proposed 
communication strategy in Section II (see Fig. [U, we have 
TV = 500 (since TV = TVc/2 and TVc = TcT^w = 1000) and, 
L = TdBw = 5. The PDP is the same for each user and is 
given by al^i = 1/L, I = 0,1,..., L - 1, k = 1,2,..., K. 
Subsequently for the first user (i.e. fc = 1) we present the 
variation in the required SNR (to achieve a fixed desired MSB) 
as a function of increasing M and K. 

In Fig. 0] for a fixed N, L, K = 10 and fixed desired 
MSB e = 10“® <C uif (uji = 7r/2500), we plot the required 
SNR ^ ^hki ^ function of increasing M. We plot 

both the numerically simulated required SNR as well as the 
analytical expression for the required SNR from (|2^ . From 
Fig. m it can be seen that the analytical expression in (|2^ is a 
tight approximation to the exact simulated required SNR. It is 
also observed that for sufficiently large M, the required SNR 
decreases by roughly 1.5 dB with every doubling in M as is 
suggested by Proposition 2 (see also Remark 8). 

In Fig. |5] for a fixed M = 160, fixed TV, L and a fixed 
desired MSB e = 10“® <C ojI {uji = 7r/2500), we plot 

®Note that in the correlation r^[T{b, k, l)]r.m[T{b + 1, k, Z)] between 
adjacent pilot-blocks lead to the term KLpa\hkm[k\\‘^e.:>‘^>^^^. 

’®Note that the second term in the R.H.S. of 03 and 03 is proportional 
to IjK due to the fact that the pilot used is an impulse which is transmitted 
once every KL channel uses and therefore its instantaneous power is KL 
times higher than the average transmitted power, pu (see Q). 
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Fig. 4 Plot of the required SNR to achieve a fixed desired MSE, 
e = — wi)^] = 10“® with increasing number of BS antennas, 

M, for the following fixed parameters: N = 500, if = 10, L = 5, 
^ 2^' 


Fig. 5 Plot of the required SNR to achieve a fixed desired MSE, 
e = E[(thi — wi)^] = 10“® with increasing number of UTs, K, 
for the following fixed parameters: N = 500, M = 160, L = 5, 
^ 2^' 


the numerically simulated required SNR ^ ^hu ^ 

function of increasing K. It is also observed that the analytical 
approximation to the required SNR (see (|2^ ) is tight. As 
discussed in Remark 9, the required SNR decreases with 
increasing number of UTs, K. 


Appendix 

Proof of Theorem 1 

With vl = and from (|9]l we get 


arg(pfc) = tan 


Gfc sm{uJkKL) + v\ 
Gk cos{u}kKL) + vl 




Note that \ujkKL\ <§; tt => cos(wfciTL) 
approximation in (fT4l i. we get 

(2B-3)Gfc 
{B-lhk 




Gl 


M{B- l)KLGl 


(a) 

< 1, 


(30) 

1. Using this 

(31) 


where (a) follows from jk ^ 7°- Therefore we conclude 
that 1^1/Gk is small (i.e., ^ <C 1) with high probability. 
Using this approximation, we can write Gk cos{uJkKL)+i'l « 
GkCOsiuJkKL). Since E[(i/^)^] < E[(i/^)^] (compare (fT4l i 
and with the approximation cos{uJkKL) Ri 1), we can 
also say that ^ ^ 1 with high probability. Using above 
approximations, in (l30l l. we get 

Q 

t&niuJkK L) 


ai'g(Pfe) ~ tan 


-1 


IS *. -1 

Ri tan 


IS j- -1 

Ri tan 


(a) _1 

R! tan 


Gfc cosiujkKL) 

ta,n{uJkKL) + 

Gk 

tan(wfcArL) + tan(-^) 

Gk 

1 — tan(ajfcArL) tan(-^) 

Gfc 


tan I uJkKL + 

Gk 


= cufciTL+^,(32) 


where (a) follows from substituting cos{uJkKL) Ri 1. Step (b) 
follows from the fact that since ^ <C 1 with high probability, 

^ Ri tan(^) and | tan(wfcArU) tan(^)| <C 1 with high 
probability. Step (c) follows from the standard result tan(A + 
B) = (tan A + tani?)/(l — tan Atani?). Substituting ([33 in 
(Ha, we get the desired approximation for the CFO estimate 
in (fT^ . 
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